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Abstract 

Coherent network error correction is the error-control problem in network coding with the knowledge of the 
network codes at the source and sink nodes. With respect to a given set of local encoding kernels defining a linear 
network code, we obtain refined versions of the Hamming bound, the Singleton bound and the Gilbert- Varshamov 
bound for coherent network error correction. Similar to its classical counterpart, this refined Singleton bound is tight 
for linear network codes. The tightness of this refined bound is shown by two construction algorithms of linear 
network codes achieving this bound. These two algorithms illustrate different design methods: one makes use of 
existing network coding algorithms for error-free transmission and the other makes use of classical error-correcting 
codes. The implication of the tightness of the refined Singleton bound is that the sink nodes with higher maximum 
flow values can have higher error correction capabilities. 

Index Terms 

Network error correction, network coding, Hamming bound. Singleton bound, Gilbert- Varshamov bound, network 
code construction. 

I. Introduction 

Network coding has been extensively studied for multicasting information in a directed communication network 
when the communication links in the network are error free. It was shown by Ahlswede et al. HI that the network 
capacity for multicast satisfies the max-flow min-cut theorem, and this capacity can be achieved by network coding. 
Li, Yeung, and Cai [2J further showed that it is sufficient to consider linear network codes only. Subsequently, 
Koetter and Medard (|3l developed a matrix framework for network coding. Jaggi et al. [j4J proposed a deterministic 
polynomial-time algorithm to construct linear network codes. Ho et al. H showed that optimal linear network codes 
can be efficiently constructed by a randomized algorithm with an exponentially decreasing probability of failure. 
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Fig. 1. This is a classical error correction example, where s is the source node and t is the sink node. This model is extensively studied by 
algebraic coding. 

A. Network Error Correction 

Researchers also studied how to achieve reliable communication by network coding when the communication 
links are not perfect. For example, network transmission may suffer from link failures fi], random errors Q and 
maliciously injected errors f7^. We refer to these distortions in network transmission collectively as errors, and the 
network coding techniques for combating errors as network error correction. 

Fig. [U shows one special case of network error correction with two nodes, one source node and one sink node, 
which are connected by parallel links. This is the model studied in classical algebraic coding theory Isl, ID, a very 
rich research field for the past 50 years. 

Cai and Yeung ||6|, lITOl . lITTI extended the study of algebraic coding from classical error correction to network 
error correction. They generalized the Hamming bound (sphere-packing bound), the Singleton bound and the Gilbert- 
Varshamov bound (sphere-covering bound) in classical error correction coding to network coding. Zhang studied 
network error correction in packet networks |[T2l, where an algebraic definition of the minimum distance for linear 
network codes was introduced and the decoding problem was studied. The relation between network codes and 
maximum distance separation (MDS) codes in classical algebraic coding ifTsl was clarified in lfT4ll . 

In 161, ifTOll . lim . the common assumption is that the sink nodes know the network topology as well as the 
network code used in transmission. This kind of network error correction is referred to as coherent network error 
correction. By contrast, network error correction without this assumption is referred to as noncoherent network error 
correctioi^ When using the deterministic construction of linear network codes 111, H, the network transmission 
is usually regarded as "coherent". For random network coding, the network transmission is usually regarded as 
"noncoherent". It is possible, however, to use noncoherent transmission for deterministically constructed network 
codes and use coherent transmission for randomly constructed network codes. 

In ifTsl . Yang et al. developed a framework for characterizing error correction/detection capabilities of network 
codes for coherent network error correction. Their findings are summarized as follows. First, the error correc- 
tion/detection capabilities of a network code are completely characterized by a two-dimensional region of parameters 
which reduces to the minimum Hamming distance when 1) the network code is linear, and 2) the weight measure 
on the error vectors is the Hamming weight. For a nonlinear network code, two different minimum distances are 

'Coherent and noncoherent transmissions for network coding are analogous to coherent and noncoherent transmissions for multiple antenna 
channels in wireless communications. 
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needed for characterizing the capabilities of the code for error correction and for error detection. This led to the 
discovery that for a nonlinear network code, the number of correctable errors can be more than half of the number 
of detectable errors. (For classical algebraic codes, the number of correctable errors is always the largest integer 
not greater than half of the number of detectable errors.) Further, for the general case, an equivalence relation on 
weight measures was defined and it was shown that weight measures belonging to the same equivalence class lead 
to the same minimum weight decoder. In the special case of network coding, four weight measures, including the 
Hamming weight and others that have been used in various works ||T21 . ||T6l , IfTTl , were proved to be in the same 
equivalence class for linear network codes. 

Network error detection by random network coding has been studied by Ho et al. ifTSl . Jaggi et al. lfT6l . Q, 
lfT9l have developed random algorithms for network error correction with various assumptions on the adversaries. 
A part of the work by Zhang [12| considers packet network error correction when the network code is not known 
by receivers, where a sufficient condition for correct decoding was given in terms of the minimum distance. The 
distribution of the minimum distance when applying random network coding was bounded by Balli, Yan and Zhang 
II20I . They also studied decoding network error-correcting codes beyond the error correction capabiUty ||2T1 . 

Koetter and Kschischang ll22ll introduced a general framework for noncoherent network error correction. In their 
framework, messages are modulated as subspaces, so a code for noncoherent network error correction is also called 
a subspace code. They proved a Singleton bound, a sphere-packing bound and a sphere-covering bound for subspace 
codes. Using rank-metric codes, Silva and Kschischang 12311 constructed nearly optimal subspace codes and studied 
the decoding algorithms. 

B. Paper Outline 

In this paper, we follow the framework provided in ifTSll to study the design of linear network codes for coherent 
network error correction. 

The coding bounds for coherent network error correction obtained in ||6l, ifTOl . ifTTl take only one sink node 
with the smallest maximum flow from the source node into consideration. We observe that each sink node 
can be considered individually and a sink node with larger maximum flow can potentially have higher error 
correction/detection capability. These observations lead to the refined versions of the Hamming bound, the Singleton 
bound and the Gilbert- Varshamov bound for network error correction to be proved in this work. By way of the 
weight properties of network coding, the proof of these bounds are as transparent as their classical counterparts for 
linear network codes. By contrast, the proofs of the original versions of these bounds (not necessarily for linear 
network codes) in ifTOl . IfTTl are considerably more compUcated. The refined Singleton bound was also implicitly 
obtained by Zhang ifTSll independently. When applying to classical error correction, these bounds reduce to the 
classical Hamming bound, the classical Singleton bound and the classical Gilbert- Varshamov bound, respectively. 

Similar to its classical counterpart, this refined Singleton bound is tight for linear network codes. The tightness of 
this refined bound is shown by two construction algorithms of linear network codes achieving the bound. A linear 
network code consists of two parts, a codebook and a set of local encoding kernels (defined in Section [nil. Our 
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first algorithm finds a codebook based on a given set of local encoding kernels. The set of local encoding kernels 
that meets our requirement can be found by the polynomial-time algorithm in H|. The second algorithm finds a set 
of local encoding kernels based on a given classical error-correcting code satisfying a certain minimum distance 
requirement as the codebook. These two algorithms illustrate different design methods. The set of local encoding 
kernels determines the transfer matrices of the network. The first algorithm, similar to the classical algebraic coding, 
designs a codebook for the transfer matrices. The second algorithm, instead, designs transfer matrices to match a 
codebook. 

Various parts of this paper have appeared in 1241 . ||25]| . Subsequent to ||24| . based on the idea of static network 
codes lis, Matsumoto ll26ll proposed an algorithm for constructing linear network codes achieving the refined 
Singleton bound. In contrast to ours, Matsumoto's algorithm designs the codebook and the local encoding kernels 
together The complexity and field size requirements of these three algorithms are compared. 

This paper is organized as follows. In Section [III we formulate the network error correction problem and review 
some previous works. The refined coding bounds for coherent network error correction are proved in Section |in] 
In Section |IV| the tightness of the refined Singleton bound is proved, and the first construction algorithm is given. 
In Section |V] we introduce another construction algorithm that can achieve the refined Singleton bound. In the last 
section, we summarize our work and discuss future work. 

II. Network Error-Correcting Problem 

A. Problem Formulation 

Let F be a finite field with q elements. Unless otherwise specified, all the algebraic operations in this paper are 
over this field. A communication network is represented by a directed acyclic graph (DAG). (For a comprehensive 
discussion of directed acyclic graph, please refer to [21] and the references therein.) A DAG is an ordered pair 
Q = (V, £) where V is the set of nodes and £ is the set of edges. There can be multiple edges between a 
pair of nodes, each of which represents a communication link that can transmit one symbol in the finite field 
F. For an edge e from node a to b, we call a (b) the tail (head) of the edge, denoted by tail(e) (head(e)). Let 
1(a) = {e e £ : head(e) = a} and 0(a) = {e E £ : tail(e) = a} be the sets of incoming edges and outgoing edges 
of node a, respectively. 

A directed path in is a sequence of edges {ci e f : i = 1, 2, • • • , fc} such that head(ei) = tail(ei+i) for 
i — 1, 2, • • • ,k — 1. Such a directed path is also called a path from tail(ei) to head(efc). A directed acyclic graph 
gives rise to a partial order < on its nodes, where a < b when there exists a directed path from a to 6 in the DAG. 
Similarly, a DAG gives rise to a partial order < on the edges, where e < e' when e = e' or head(e) < tail(e'). 
In other word, e < e' if there exists a directed path from tail(e) to head(e') that uses both e and e'. We call this 
partial order on the edges the associated partial order on the edges. We extend the associated partial order on the 
edges to a total order on the edges such that for all e and e' in £, either e < e' or e' < e. Such an extension is not 
unique, but we fix one in our discussion and write £ = {ci : i = 1,2, ■ ■ ■ , \£\}. 
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A multicast network is an ordered triple {Q, s, T) where Q is the network, s G V is the source node and 7" C V 
is the set of sink nodes. The source node contains the messages that are demanded by all the sink nodes. Without 
loss of generality (WLOG), we assume I(s) = 0. Let Ug = |0(s)|. The source node s encodes its message into 
a row vector x = [xe,e G 0(s)] G F"% called the codeword. The set of all codewords is the codebook, denoted 
by C. Note that we do not require C to be a subspace. The source node s transmits a codeword by mapping its Ug 
components onto the edges in 0(s). For any node v ^ s with l{v) = 0, we assume that this node outputs the zero 
element of F to all its outgoing edges. 

An error vector z is an |f j-dimensional row vector over F with the ith component representing the error on 
the ith edge in £. An error pattern is a subset of £. Let be the error pattern corresponding to the non-zero 
components of error vector z. An error vector z is said to match an error pattern p if Pz C p. The set of all error 
vectors that match error pattern p is denoted by p* . Let and be the input and output of edge e, respectively, 
and let the error on the edge be Zg. The relation between F^, F^ and is given by 

Fe = Fe + Ze. (1) 

For any set of edges p, form two row vectors 

Fp = [Fe,e G p], 

and 

Fp = [F,,eep]. 

A network code on network Q is a codebook C C F"=^ and a family of local encoding functions {/3e : e G £\ 0(s)}, 
where /3e : fI ^ F, such that 

Fe = /3e(Fi(taii(e))). (2) 

Communication over the network with the network code defined above is in an upstream-to-downstream order: 
a node applies its local encoding functions only after it receives the outputs from all its incoming edges. Since the 
network is acyclic, this can be achieved in light of the partial order on the nodes. With Fo(s) — x and an error 
vector z, the symbol Fg, Ve G £, can be determined inductively by ([T]) and (|2|. When we want to indicate the 
dependence of Fg and Fg on x and z explicitly, we will write them as Fe(x, z) and Fe(x, z), respectively. 

A network code is linear if (3^ is a linear function for all e G £ \ 0(s), i.e., 

Fg — ^ ^ /3e',eFe', 
e'ef 

where /3e'.e is called the local encoding kernel from edge e' to edge e. The local encoding kernel (3e'.e can be 
non-zero only if e' G I(tail(e)). Define the \£\ x \£\ one-step transformation matrix K — [Ki,j] in network Q as 
Ki,j — Pei.cj- For an acyclic network, = for some positive integer N (see [3J and 1281 for details). Define 
the transfer matrix of the network by F = (I — K)^^ 
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For a set of edges p, define a \p\ x \£\ matrix Ap = [Ai^j] by 

{1 if e," is the ith edee in p, 
(3) 
otherwise. 

By applying the order on 8 to p, the |p| nonzero columns of Ap form an identity matrix. To simplify notation, we 
write Fp p/ = ApFAj, . For input x and error vector z, the output of the edges in p is 

Fp(x,z) = (xAo(,)+z)FA; (4) 

= xFo(.),p + zFAj. (5) 

Writing F^(x, z) — Fi(^)(x, z) for a node w, the received vector for a sink node t is 

Ft(x,z) =xF,,t + zFt, (6) 

where Ys.t — Fo(s).i(t), and Ft = FA^^^. Here F^ t and Fj are the transfer matrices for message transmission and 
error transmission, respectively. 

B. An Extension of Classical Error Correction 

In this paper, we study error correction coding over the channel given in in which Fg t and Ft are known by 
the source node s and the sink node t. The channel transformation is determined by the transfer matrices. In classical 
error correction given in Fig[Tl the transfer matrices are identity matrices. Thus, linear network error correction is 
an extension of classical error correction with general transfer matrices. Our work follows this perspective to extend 
a number of results in classical error correction to network error correction. 

Different from classical error correction, network error correction provides a new freedom for coding design — 
the local encoding kernels can be chosen under the constraint of the network topology. One of our coding algorithm 
in this paper makes use of this freedom. 

C. Existing Results 

In ifTSl , Yang et al. developed a framework for characterizing error correction/detection capabilities of linear 
network codes for coherent network error correction. They define equivalence classes of weight measures on error 
vectors. Weight measures in the same equivalence class have the same characterizations of error correction/detection 
capabilities and induce the same minimum weight decoder Four weight measures, namely the Hamming weight 
and the others that have been used in the works lfT2l . lfT6l . ifTTl . are proved to be in the same equivalence class for 
linear network codes. Henceforth, we only consider the Hamming weight on error vectors in this paper. For sink 
node t and nonnegative integer c, define 

$t(c) = {zFt izeFl^l, ii;H(z) <c}, (7) 

where WHi^) is the Hamming weight of a vector z. 
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Definition 1: Consider a linear network code with codebook C. For each sink node t, define the distance measure 

A(xi, X2) = min{c : (xi - X2)F^,f £ $t(c)} (8) 
and define the minimum distance of the codebook 

dmin.t min D4(Xi,X2). (9) 

Xi5£X2eC 

We know that Dt is a translation-invariant metric ifTSl . Consider xi, X2 G C. For any z with zF^ — (xi — X2)F5 (, 
we have (xi - X2)Fs,t £ $t(w//(z))- Thus 

A(xi,X2) < min Wh(z). 

z:(xi-X2)Fs,t=zFt 

On the other hand, we see that (xi— X2)Fs^t e $t(Z?t(xi, X2)). So, there exists z e F'^' with i/;//(z) = Z?f(xi,X2) 
and (xi - X2)Fs,t = zF*. Thus, 

A(xi,X2)> min wh{z). 

z:(xi-X2)Fs,t=zFt 

Therefore, we can equivalently write 

A(xi,X2)= min wh{z). (10) 

z:(xi-X2)Fs 4=zFt 

Definition 2: Minimum Weight Decoder / at a sink node t, denoted by MWDj , decodes a received vector y as 
follows: First, find all the solutions of the equation 

i^t(x,z)=y (11) 

with X G C and z G F'^' as variables. A pair (x, z), consisting of the message part x and the error part z, is said to 
be a solution if it satisfies (fTTT i. and (x, z) is a minimum weight solution if wh{z) achieves the minimum among 
all the solutions. If all the minimum weight solutions have the identical message parts, the decoder outputs the 
common message part as the decoded message. Otherwise, the decoder outputs a warning that errors have occurred. 

A code is c-error-correcting at sink node t if all error vectors z with wniz) < c are correctable by MWD^. 

Theorem 1 ([15]): A linear network code is c-error-correcting at sink node t if and only if (imin,t > 2c + 1. 

For two subsets Vi, V2 C F"% define 

V1+V2 = {vi + V2 : vi G Vi, V2 G V2}. 

For V G F"= and V C F"% we also write {v} + V as v + V. For sink node t and nonnegative integer c, define the 
decoding sphere of a codeword x as 

$t(x,c) = {^t(x,z) : z G ¥\^^,wh{z) < c} 

= xF,,t+$t(c) (12) 

Definition 3: If $t(x, c) for all x G C are nonempty and disjoint. Minimum Weight Decoder II at sink node t, 
denoted by MWDj^(c), decodes a received vector y as follows: If y G $f(x, c) for some x G C, the decoder 
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outputs X as the decoded message. If y is not in any of the decoding spheres, the decoder outputs a warning that 
errors have occurred. 

A code is c-error-detecting at sink node t if MWD[^(0) exists and all error vector z with < wh{z) < c are 
detectable by MWD^'^(O). 

Theorem 2 ([15]): A code is c-error-detecting at sink node t if and only if dmin,t > c + 1. 

Furthermore, we can use MWDj^(c), c > 0, for joint error correction and detection. Erasure correction is 
error correction with the potential positions of the errors in the network known by the decoder. We can similarly 
characterize the erasure correction capability of linear network codes by dn^i^.t- Readers are referred to ifTsl for the 
details. 

There exist coding bounds on network codes that corresponding to the classical Hamming bound. Singleton 
bound and Gilbert- Varshamov bound. We review some of the results in ifTOl . lITTI . The maximum flow from node 
a to node b is the maximum number of edge-disjoint paths from a to b, denoted by maxflow(a, 6). Let 



and 



C^min — niindmint, 

ter 



n = minmaxflow(s, <). 



In terms of the notion of minimum distance, the Hamming bound and the Singleton bound for network codes 
obtained in ifTol can be restated as 

where t = [ '^'"'^^"^ J, and 

Id < (14) 
respectively, where q is the field size. The tightness of (fl4] i has been proved in ifTTl . 

III. Refined Coding Bounds 

In this section, we present refined versions of the coding bounds in ifTOl . ifTTl for linear network codes. In terms 
of the distance measures developed in |15|, the proofs of these bounds are as transparent as the their classical 
counterparts. 

A. Hamming Bound and Singleton Bound 

Theorem 3: Consider a linear network code with codebook C, rank(rs j) = and dmin.t > 0. Then \C\ satisfies 

1) the refined Hamming bound 

'^'""^ Ea„(7)(.-i)- ' 

where Tt — [ J, and 

2) the refined Singleton bound 

[C[ < qn-d„.„,t+i^ (16) 
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for all sink nodes t. 
Remark: The refined Singleton bound can be rewritten as 

dmin,t <rt~ logq |C| + 1 < niaxflow(s, t) - \ogg \C\ + 1, 

for all sink nodes t, which suggests that the sink nodes with larger maximum flow values can potentially have 
higher error correction capabilities. We present network codes that achieve this bound in Section [TVl and [Vl 

Proof: Fix a sink node t. Since rank(Fs j) = rt, we can find rt linearly independent rows of Fs.t- Let 
Pt C 0(s) such that \pt\ — rt and Fp^K^t) is a full rank submatrix of Fg.t- Note that pt can be regarded as an error 
pattern. Define a mapping ipt '■ C ^ ¥'''* by (j)t{x) = x' if x'Fp^ — xFs,*. Since the rows of Fp^K^t) form a 
basis for the row space of Fs,t, 4>t is well defined. The mapping (f)t is one-to-one because otherwise there exists 
x' G F*"' such that x'F^^ = xiF^.f — X2Fs.f for distinct Xi,X2 G C, a contradiction to the assumption that 
dmin,t > 0. Define 

Ct = {0t(x) : X e C}. 

Since (pt is a one-to-one mapping, \Ct \ = \C\. 

We claim that, as a classical error-correcting code of length rt, Ct has minimum distance (imin(Ct) > (imin,t- We 
prove this claim by contradiction. If (imin(Ct) < dmin,*, it means there exist x']^,X2 G Ct such that wnix-'i — x'a) < 
(iniin,t- Let xi = (fi^^i^'i) and X2 = ^("^(xj). We know that xi,X2 G C, and 

(xi - X2)F,,t = (x'l - x^)Fp^,i(t) = z'Ft, 

where z' = (x'j^ — X2)Ap^. Thus, 

A(X1,X2) < Wh{2.') = Wh{x.[ - X2) < dmin.t, 

where the first inequality follows from ( fTol i. So we have a contradiction to dmin,t < £'t(x;i, X2) and hence (imin(Ct) > 
c^min,* as claimed. Applying the Hamming bound and the Singleton bound for classical error-correcting codes to 
Ct, we have 

where r/ = [i^Es^ihiij > ^-j, and 

\Ct\ < < g'-t-rfmin,,+i^ 

The proof is completed by noting that |C| = |Cf |. ■ 
Remark: Let / be an upper bound on the size of a classical block code in terms of its minimum distance such 
that / is monotonically decreasing. Examples of / are the Hamming bound and the Singleton bound. Applying 
this bound to Ct, we have 

\Ct\ < /«i„(Ct)). 
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Since / is monotonically decreasing, together with rfmin(Ct) > dmin.t as shown in the above proof, we have 

\C\ = \Ct\< fid^iniCt)) < /(dmin.t). (17) 

In other words, the bounds in ( fTTI ) is simply the upper bound / applied to C as if C is a classical block code with 
minimum distance dmin,*. 
Lemma 4: 

q-m qm+l 



for T < to/2. 

Proof: This inequality can be established by considering 



q-m qTn+l 



ELo (T) i<i-^y ELo rv) ('^ - 1) 



+1 

qm+l 

^ ELorr)('?-ir ^''^ 

where JTST l holds because '^^'^"^i""'^"' > 1 given that g > 2 and i < t < m/2. ■ 
The refined Hamming bound and the refined Singleton bound, as we will show, imply the bounds shown in ( fT3] ) 
and (fT4l) but not vice versa. The refined Hamming bound implies 

|C|< ^ 



EIlo (7)(9~iy 
- ELo(?)(?"i)^ ^''^ 

„maxfiow(s,i) 

< ^ a . (20) 

" ELo (--«7(^'*))(g-l)' 

for all sink nodes t, where fT% follows from r = [ '^'"™~"'" J < [ '^'"'"^*~"'" J — Tt, and (|20] i follows from rt < 
m£ixflow(s, t) and the inequality proved in Lemma |4] By the same inequality, upon minimizing over all sink 
nodes t G T, we obtain (fTsT l. Toward verifying the condition for applying the inequality in Lemma |4] in the above, 

we see rt > (imin,t ~ 1 since 1 < |C| < q''t^'^ni".*+i. Then 

, , C^min,* ~ 1 ^ ''t 
T < Tt < ■ < — 

- * - 2 - 2 

for all t e r. 

The refined Singleton bound is maximized when rt = maxflow(s,t) for all t £ T- This can be achieved by a 
linear broadcast code whose existence was proved in ||2|, llT4l . To show that the refined Singleton bound implies 
(O, consider 

^ ^maxflow(s,t) — cZmin+l 

for all sink nodes t. Then (fT4l) is obtained upon minimizing over all t £ T- 
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B. Sphere-Packing Bound 

For nonnegative integer d, define 

At(x,d) = {x' e : A(x',x) < d}. (21) 

Here Dt{-, •) is defined in (|8]l. Since Dt is a translation invariant metric ifTSll . we have At(x, d) = x + Aj(0,d), 
which implies \At{x,d)\ = |At(0,(i)|. Another fact is that At{0,d) is closed under scalar multiplication, i.e., 

aAt{0,d) ^ {ax : X e At(0,d)} = At(d), 



where a G F and a 7^ 0. 
Lemma 5: 



^J']q'> |A,(0,d)|q-("-^') = > J2 f? ) (9 - (22) 



i=0 ^ 

where rt — rank(Fs t) and d < rt- 

Proof: Applying the definition of Dt, Af(0,(i) can be rewritten as 

At(0, d) = {x e : xF,,, e (23) 

where $t is defined in (Q. Since the rank of < is rt, the null space of Fj t defined as 

Null(F,,0 = {x : xF,,t = 0} 

has dimension ng—rt- By the theory of linear system of equations, for each vector y in ^t{d), we have |Null(Fs,t)| = 
q^B-rt vector x satisfies xF^ f = y, and all such x are in A((0, d). Thus, 

|Ai(0,d)| (24) 

By the definition of $t, we have 

|$t(d)| < |{z e FI^I : wh{2) < d}\ 

< (25) 



Together with ( |24] |. we obtain the first inequality in ( |22] |. 

Since rank(Fs,t) — rt, we can find rt linearly independent rows of Fg.t- Let pt C 0(s) such that — rt and 
Fpt,i(t) is a full row rank submatrix of Fs 4. Note that Fp^ is also a submatrix of Ft- Since, 

= {zFt : wh{z) <d}D {z'Fp^,i(() : wh{z') < d}, 

we have 

\Md)\>\{z'Fp^j^t):w„{z')<d}\ 

= |{z'eF'-' ■.WH{z')<d}\ 
d , 

r 



i=0 
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The proof is complete. ■ 
Using the idea of sphere packing, we have the following stronger version of the refined Hamming bound in 
Theorem [3] 

Theorem 6 (Sphere -packing bound): A linear network code with codebook C and positive minimum distance 
rfmin.t for all sink nodes t satisfies 



\c\< ' 



Proof: For different codewords Xi and X2, we show that Af (xi, Tf) and At(x2, Tf) are disjoint by contradiction. 



where n = [^sii^^J 



Let 

X e At(xi, n) nAf(x2, Tt). 

By the definition of Af in ( 1211 1. we have Z?f(xi,x) < Tf and _Di(x2,x) < Tf. Applying the triangle inequality of 
Dt, we have 

A(X1,X2) < A(X1,X) + A(X2,X) 
< 2Tt 

^ '^miii,i li 

which is a contradiction to the definition of dmin.t- Therefore, q^" > X^xgc '''01 = The proof 

is complete by considering the equality in Lemma |5] ■ 
Applying the second inequality in Lemma|5] Theorem|6]implies the refined Hamming bound in Theorem[3] Thus 
Theorem |6] gives a potentially tighter upper bound on \C\ than the refined Hamming bound, although the former is 
less explicit than the latter. 

C. Gilbert Bound and Varshamov Bound 

We have the following sphere-covering type bounds for linear network codes. 

Theorem 7 (Gilbert bound): Given a set of local encoding kernels, let |C|max be the maximum possible size of 
codebooks such that the network code has positive minimum distance c?min,t for each sink node t. Then, 

ICUax > (26) 

where 

A(0) = U(erAt(0, d^in,t - 1). (27) 
Proof: Let C be a codebook with the maximum possible size, and let 

A(c) = UterAt(c,(i„,in,t - 1). 
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For any x e F"% there exists a codeword c e C and a sink node t such that 

A(X, C) < dinin,t - 1, 

since otherwise we could add x to the codebook while keeping the minimum distance. By definition, we know 

A(c) = Uter{x e F"= : A(x,c) < - 1}. 

Hence, the whole space is contained in the union of A(c) over all codewords c G C, i.e., 

=UcecA(c). 

Since A(c) = c + A(0), we have |A(c)| = |A(0)|. So we deduce that < |C|1A(0)|. ■ 
We say a codebook is linear if it is a vector space. 

Lemma 8: Consider a linear network code with linear codebook C. The minimum distance (imin,* > if and 
only if 

cnAt(o,d-i) = {o}. 

Proof: If there exists x G C fl At(0, d — 1) and x 7^ 0, then Dt{Q^ x) < d. Since G C, we have dmin.t < d- 
This proves the sufficient condition. 

Now we prove the necessary condition. For xi,X2 G C, xi — X2 G C. Since 

A(xi,X2) = A(xi -X2,0), 

we have 

dmin,t = min A(x, 0). 
xec,x5^o 

Thus, 

cnAt(o,d„,i„,t-i) = {o}. 

The proof is completed noting that Af (0, dyain.t — 1) D Af (0, d — 1). ■ 

Theorem 9 (Varshamov bound): Given a set of local encoding kernels, let Wmax be the maximum possible 
dimension of linear codebooks such that the network code has positive minimum distance dmin,* for each sink 
node t. Then, 

wmax > -logJA(0)|, (28) 

where A(0) is defined in (|27] |. 

Proof: Let C be a linear codebook with the maximum possible dimension. By Lemma[8j C fl A(0) = {0}. We 
claim that 

F"-' = A(0) + C. (29) 

If the claim is true, then 

= |A(0) +C| < |A(0)||C| = lA(0)lg'^— , 
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proving 

Since F"= D A(0) + C, so we only need to show F"= C A(0) + C. Assume there exists 

geF"- \(A(0)+C). (30) 

Let C = C + (g). Then C is a subspace with dimension Wmax + 1. If C H A(0) ^ {0}, then there exists a non-zero 
vector 

c + agGA(0), (31) 

where c e C and a £ F. Here, a 7^ 0, otherwise we have c = because C n A(0) = {0}. Since At(0, dmin.t — 1) 
is closed under scalar multiplication for all t E T, see from dZTl l that the same holds for A(0). Thus from (l3Tl i. 

g G A(0) - a-^c C A(0) +C, 

which is a contradiction to ( l30b . Therefore, C fl A(0) = {0}. By Lemma|8] C is a codebook such that the network 
code has unicast minimum distance larger than or equal to dmin.t, which is a contradiction on the maximality of 
C. The proof is completed. ■ 

IV. Tightness of the Singleton Bound and Code Construction 

For an (w, (rt : t E T),{dt : t E T)) linear network code, we refer to one for which the codebook C is an 
cj-dimensional subspace of F"% the rank of the transfer matrix Yg.t is rt, and the minimum distance for sink node 
t is at least c?(, t e T. In this section, we propose an algorithm to construct (w, (rt : t E T),(dt '■ t E T)) linear 
network codes that can achieve the refined Singleton bound. 

A. Tightness of the Singleton Bound 

Theorem 10: Given a set of local encoding kernels with rt = rank(rs t) over a finite field with size q, for every 

< Lu < minrt, (32) 

there exists a codebook C with |C| such that 

dmin,t =rt + 1 (33) 

for all sink nodes t, provided that q is sufficiently large. 

Proof: Fix an u which satisfies (|32] |. We will construct an w-dimensional linear codebook which together with 
the given set of local encoding kernels constitutes a Unear network code that satisfies (|33T l for all t. Note that (|32] l 
and ( [33I 1 imply 

dmin,t > 1- 

We construct the codebook C by finding a basis. Let gi, • ■ • , gc^ E F"= be a sequence of vectors obtained as 
follows. For each i, 1 < i < oj, choose g^ such that 

g, ^ At(0,rt-cj) + (gi,--- ,g,_i) (34) 
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for each sink node t. As we will show, this implies 

At(0,rt-w)n(gi,--- ,g,) = {0} (35) 

for each sink node t. If such gi, • • • ,gc^ exist, then we claim that C = (gi, • • • is a codebook with the 

desired properties. To verify this claim, first, we see that gi, • ■ • , gc^ are linearly independent since ( l34l i holds for 
i = 1, • ■ • , w; second, we have dmin.t > rt — oj + 1 since ( [35] ) holds for i = oj (ref Lemma [8]). Note that by ( fT6l ). 
the refined Singleton bound, we indeed have dmin.t = f t — + 1, namely ( [33] l for any sink node t. 
Now we show that gj satisfying ( l34b exists if the field size q is sufficiently large. Observe that 

|At(0,rt-w) + (gi,--- ,g,_i)| 
< |At(0,rt-c.)|g-i 



rt - w 
1^1 

Vt — UJ 



H 5 



where < l36] t follows from Lemma [51 If 



'?"^>E( (37) 



ter 



we have 

F"= \ Ut(At(0, rt - + (gi, • • • , g,_i)) ^ 0, 
i.e., there exists a satisfying ( l34b . Therefore, if g satisfies dJTl ) for all i = 1, • • ■ ,a;, or equivalently 

^—L \rt — u I 

then there exists a vector that can be chosen as gi for i = 1, • • • , cj. 

Fix gi, • • • , gi that satisfy ( l34l i. We now prove by induction that ( |35] ) holds for gi, • • • , gi. If ( l35] ) does not hold 
for z = 1, then there exists a non-zero vector agi G At(0,rt — uj), where a G F. Since At(0,rt — lS) is closed 
under scalar multiplication and a 7^ 0, we have gi e At(0, rj — w), a contradiction to ( [34] i for i = 1. Assume 

holds for i < /c — 1. If (|35]) does not hold for i ^ k, then there exists a non-zero vector 

fc 

^a^g^ e A((0, rt - w), 

where ai £ F. If ak — 0, 

fc-i 



^a^gi G At(0,rt -w), 



1=1 



a contradiction to the assumption that (|35] | holds for i = fc — 1. Thus ak ^ 0. Again, by At(0, rt — oj) being closed 
under scalar multiplication, we have 

fc-i 

gk e At(0,rt ~uj) -a^^^Qfigi 

i=l 

C At(0,rt-w) + (gi,--- ,gfc-i), 
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a contradiction to satisfying (|34] l. The proof is completed. ■ 

B. The First Construction Algorithm 

The proof of Theorem [TOl gives a construction algorithm for an (w, {rt : t ^ T), {dt : t E T)) linear network 
code and it also verifies the correctness of the algorithm when the field size is sufficiently large. This algorithm, 
called Algorithm 1, makes use of existing algorithms (e.g., the Jaggi-Sanders algorithm ID) to construct the local 
encoding kernels. The pseudo code of Algorithm 1 is shown below. 

Algorithm 1: Construct network codes that achieve the refined Singleton bound, 
input : (g, s, T), (n : t e T), lo, {dt : t e T) with rj < maxflow(s, t) yt e T 

output: local encoding kernels and C 

1 begin 

2 Construct a set of local encoding kernels such that rank(Fs,t) = r^; 

3 for i l,uj do 

4 find g, such that g^ ^ Ut At(0, - 1) + (gi, • • • , g^_i) ; 

5 end 

6 end 



The analysis of the complexity of the algorithm requires the following lemma implied by Lemma 5 and 8 in ||4|. 

Lemma 11: Suppose m < q, the field size, and Bk C F", fc = 1, • • • , m, are subspaces with dim(i3fc) < n. A 
vector u G F" \ U™^^Bk can be found in time 0{n?m + nm?). 

Proof: For each Bk find a vector a/,. £ F" such that a^b^ = 0, Vb G Bk- This vector can be obtained by 
solving the system of linear equations 

Bfea^ = 0, 

where is formed by juxtaposing a set of vectors that form a basis of Bk- The complexity of solving this system 
of linear equations is 0{rr')- 

We inductively construct ui, U2, • • • , Um such that UjajT 7^ for all 1 < fc < i < to. If such a construction is 
feasible, then u,„ ^ Bk, Vfc < m. Thus, u = u„j ^ UjILj^Sfc is the desired vector 

Let Ui be any vector such that Uiaj'^ ^ 0. For 1 < i < 771 — 1, if u-iaj^^ ^ 0, we set u^+i = u^. Otherwise, 
find bi+i such that b^+ia^j^ ^ 0. We choose 

a e F \ {-(b,+ia7)/(u,a7) : 1 < i < i}, (39) 

and define 

Uj+i = axil + bi+i. 
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The existence of such an a follows from q > m > i. 
By construction, we know that 

If Ui+iaJ — au^aj +bi+iaj = for some 1 < j < i, we have a — — (bi+iaj)/(uiaj), a contradiction to (l39l) . 
So, Ui_|_iaJ ^ for all j such that 1 < j < « + 1. 

Similar to the analysis in Lemma 8], the construction of u takes time 0{nm'^). Therefore, the overall time 
complexity is 0{n?m + nm?). ■ 

We analyze the time complexity of Algorithm 1 for the representative special case that rt ~ r and dt — d for all 
t ^ T, where r < minteT niaxflow(s, t) and d < r — oj + 1. In the pseudo code. Line 2 can be realized using the 
Jaggi-Sanders algorithm with complexity | |T|?i(n + |T|)), where n = mint^T" maxflow(s, Q. Line 3-5 is 
a loop that runs Line 4 uj times. Considering At(0, d — 1) as the union of {^^\) subspaces of F*", Line 4 can be 
realized in time 0{nl\T\{^^\) + (jf-^))^) as proved in Lemma [TT] Repeating oj times, the complexity of 

Line 3-5 is 

O{u;nl\T\i + 0Jns\T\'e), 

where ^ — (J^'j^)- The overall complexity is 

0{u;n^\r\anl + \m + \£\\rHn + \T\))). 

Comparing the complexities of constructing the local encoding kernels (Line 2) and finding the codebook (Line 
3-5), the latter term in the above dominates when d > 1. 

To guarantee the existence of the code, we require the field size to be sufficiently large. From ( |38] ) in the proof 
of Theorem [lOl all finite fields with size larger than are sufficient. It is straightforward to show that this 

algorithm can also be reahzed randomly with high success probability if the field size is much larger than necessary. 

V. The Second Construction Algorithm 

Algorithm 1 can be regarded as finding a codebook for the given transfer matrices. In this section, we study 
network error correction from a different perspective by showing that we can also shape the transfer matrices 
by designing proper local encoding kernels. Following this idea, we give another algorithm that constructs an 
(w, (rt : t ^T),{dt : t hnear network code. 

A. Outline of Algorithm 2 

We first give an informal description of this algorithm. The second algorithm, called Algorithm 2, starts with a 
classical error-correcting code as the codebook. The main task of the algorithm is to design a set of local encoding 
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kernels such that the minimum distances of the network code, roughly speaking, are the same as the classical 
error-correcting code. 

It is complicated to design all the local encoding kernels altogether Instead, we use an inductive method: we 
begin with the simplest network that the source and the sink nodes are directed connected with parallel edges; 
we then extend the network by one edge in each iteration until the network becomes the one we want. For each 
iteration, we only need to choose the local encoding kernels associated with the new edge. 

We have two major issues to solve in the above method: the first is how to extend the network; the second is 
how to choose the local encoding kernels. In Section IV-BI we define a sequence of networks CJ* for a given network 
Q. The first network is the simplest one as we described, the last one is the network Q, and ^7*+^ has one more 
edge than CJ*. In Section fV-CI we give an algorithm that designs the local encoding kernels inductively. Initially, 
we choose a classical error-correcting code that satisfies certain minimum distance constraint. The local encoding 
kernels of is determined as follows: Except for the new edge, all the local encoding kernels in ^*+^ are 
inherited from Q^. The new local encoding kernels (associated with the new edge) is chosen to guarantee 1) the 
preservation of the minimum distance of the network code, and 2) the existence of the local encoding kernels to 
be chosen in the next iteration. We find a feasible condition on the new local encoding kernels to be chosen such 
that these criteria are satisfied. 

When dt — 1 for all sink nodes t, this algorithm degenerates to the Jaggi-Sanders algorithm for designing linear 
network codes for the error-free case. 

B. Iterative Formulation of Network Coding 

In this and the next subsections, we describe the algorithm formally. At the beginning, the algorithm finds rt 
edge-disjoint paths from the source node s to each sink node t using a maximum flow algorithm (for example, 
finding the augmenting paths). We assume that every edge in the network is on at least one of the X^ter'^* paths 
we have found. Otherwise, we delete the edges and the nodes that are not on any such path, and consider the 
coding problem for the new network. Note that a network code for the new network can be extended to the original 
network without changing the minimum distances by assigning zero to all the local encoding kernels associated 
with the deleted edges. 

We consider a special order on the set of edges such that 1) it is consistent with the partial order on the set of 
edges; 2) the first edges are in 0(s). The order on the paths to a particular sink node is determined by the first 
edges on the paths. 

Given a DAG Q, we construct a sequence of graphs Q"^ = (V%£'),i = 0,1, ■ ■ ■ ,\£\ ~ Ug as follows. First, 
Q'^ consists of a subgraph of Q containing only the edges in 0(s) (and the associated nodes) and all the sink 
nodes. Following the order on £, in the ith iteration C/'^^ is expanded into by appending the next edge (and 
the associated node) in £. This procedure is repeated until C/' eventually becomes Q. Note that contains ng + i 
edges and (y'^'^"^ ~ Q. A sink node t has rt incoming edges in Q^, where the jth edge is the most downstream 
edge in the truncation in of the jth edge-disjoint path from the source node s to sink node t in Q. With a slight 
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abuse of notation, we denote the set of incoming edges of a sink node t in Q"^ as I(t), when is imphed by the 
context. Fig. |2] illustrates and when Q is the butterfly network. 

The network is a multicast network with the source node s and the set of sinks T- The algorithm chooses 
a proper codebook, and then constructs local encoding kernels starting with Q''^ Except for the new edge, all the 
local encoding kernels in Q"^^^ are inherited from Q'' . We define K% r\ FJ, z* and for in view of K, F, 
Fp, z and defined for Q in Section [III respectively. Writing = -^/(t)' have 

F^(x, z') = (xAj,(,) + z'')F''(Ai'(,))T, (40) 

in view of (|4|. Further, we can define the minimum distance 0?*^^;^^ ^ corresponding to the sink node t at the ith 
iteration as in (|9|l. 

Consider a matrix M. Let (M)^ be the submatrix of M containing the columns with indices in C, and M^^ 
be the submatrix obtained by deleting the columns of M with indices in C. If C = {j}, we also write M^^ and 
{M)j for M^^-?'} and (M){j}, respectively. 

In the following, we give an iterative formulation of F^ for i > Q. Let e be the edge added to Q^^^ to form Q"^, 
and let kg = [/3e'.e ■ e' G £i-i] be an (n^ + i — l)-dimensional column vector. In the ith iteration, we need to 
determine the component /Jg'.e of kg with e' G I(tail(e)). All other components of kg are zero. Using kg, we have 

= (I-K')"^ 

r 1 ^1 

I 1 kg 

1 

(I-K^-i)-i (I-K'-i)-ikg 
1 



K'-i kg 
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1 

The matrix Aq, n has one more column with zero components than AqT\, i.e., 



A* — 



(41) 



(42) 



If the edge e is not on any path from the source node s to sink node t, we only need to append a column with 



zero components to A^^^ to form A^^^, i.e., 



(43) 



For this case, we can readily obtain from (|40] |. (HTt . (l42T i and (l43T l that 

F/(x,z^) = (xA^(,)+z^)r(Aj(,))^ 

■pi-i 







= (xA-]) + (z^)\^)r-i(A-i)^ 

= Fri(x,(z')\'). (44) 

Note that (z*)\* is an (rig + i — 1) -dimensional error vector obtained by deleting the ith component of z*, which 
corresponds to e. 

If edge e is on the jth edge-disjoint path from the source node s to sink node t, to form AJ^^^, we need to first 
append a column with zero components to A^^-j^, and then move the '1' in the jth row to the last component of 



that row. That is, if 



^l(t) - 



bi 
b2 



then 



bi 

b,-i 

1 

bj+i 



br, 0. 



(45) 
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We can then obtain Fj*(x, z*) from (gOll, (|4T]i, (gUl and (|45]l as 

(F,Xx,z*)), = (xA^(,) + z*)F*((Aj(,))T), (46) 

= (xAj,(,) +z') 

= (xAj,-^!) + (z*)\*)F*-ik, + {z%, 

and 

(F,Xx,z^))\^- 

= (xA^(3)+z^)r((Aj(,))^)\^- 
= (xA^(.) + z*) 

= (F,Xx, (z^)V))\^ (47) 

C. Algorithm 2 

Let e be the edge appended to the graph in the ith iteration for i > 0. We choose kg such that the following 
feasible condition is satisfied: 

(J^'(x,-z^))\^^0 (48) 

for all combinations of 

CI) ter, 

C2) C C {l,2,...,rt} with < \C\ < dt - 1, 
C3) non-zero x G C, and 

C4) error vector z* with wh{z^) < dt — 1 — \C\. 

If the feasible condition is satisfied for sink node t and £ = 0, we have 

xA^(,)F^(Af(,))T^z-iFj, 

for all z* and x satisfying C[3]and ClU If C is a subspace, we have dl^^^ t — '^t- Since the feasible condition is required 
for each iteration, when the algorithm terminates, the code constructed for Q satisfies (imin,t > dt. Algorithm 2 is 
also called the distance preserving algorithm since the algorithm keeps the minimum distance larger than or equal 
to dt in each iteration. Even though the feasible condition is stronger than necessary for rfjj^jjj ^ > dt , i e T, as we 
will see, it is required for the existence of the local encoding kernels for k > i such that the feasible condition is 
satisfied. 

Theorem 12: Given a linear codebook with d^^jj^ j > dt for all t E T, there exist local encoding kernels such 
that the feasible condition is satisfied for i — I, - ■ ■ ,\£\ — Ug when the field size is larger than X^teT C*Vt-i^)- 



F*-lke 
1 



((Alm)^)\^' 
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Algorithm 2: Construct (i^, : t ^ T),{dt : t ^ T)) linear network code 
input : (g, s, T), (n : t e T), cj, {dt.t e 71 

output: local encoding kernels and codebook C 

1 begin 

for each sink node t do 

choose rt edge disjoint paths from s to t; 
initialize A^); 

end 

Find a linear codebook C with d^jj^ t — d-t, "^t G T; 
F ^ I, Ao(.) ^ I; 

for each e e f \ 0(s) /n an upstream to downstream order do 
r ^ 0; 

for each sink node t do 

if no chosen path from s to t crosses e then 

Ai(t) ^ ^Ai(t) 
else e is on the jih path from s to i 

for each C with \C\ < dt — I and j ^ C Ao 
for each p with \p\ — dt — 1 ~ \C\ do 

find xo 7^ and zo matching p such that (Ft(xo, -zo))^*^^'^-'^^ = 0; 
if exist xo and Zq then 

r ^ r U {k: (xqA - zo)Fk = 0}; 

end 

end 

end 

end 

update Ai(j) using 



26 
27 

28 end 



end 



choose a vector kg in y P; 



F Fke 

1 



end 
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Proof Outline: (See the complete proof in Section [V-EI ) The linear codebook satisfies the feasible condition 
for 1 = 0. Assume we can find local encoding kernels such that the feasible condition is satisfied for i < k, where 
< fc — 1 < |£| — Us- In the fcth iteration, let e be the edge appended to Q'^^^ to form Q'^. We find that kg only 
affects ( |48] | for the case such that 

1) e is on jth path from s to t, 

2) j ^ £, and 

3) {Fj'^^ix, -z))\^u{i} ^ ^jjgj-g x^OeCze F^^+''-\ wh{z) ^dt-l- \jC\. 
For t, C, X and z satisfying the above condition, we need to choose kg such that 

{xA^ - z)F'^-ike ^ 0. (49) 

We verify that if q > X^ter ''^'^ always find such a kg. ■ 

Refer to the pseudo code of Algorithm 2 above. At the beginning, the algorithm finds rt edge-disjoint paths from 
the source node to each sink node t, and initializes F, Ao(s), and Am^,t G T by F°, Aq^^^, and A^jj,t e T, 
respectively. The algorithm takes as the input a linear codebook C such that d^jj^ ^ > dt for all sink nodes t. Such 
a codebook can be efficiently constructed by using Reed-Solomon codes. The main part of this algorithm is a loop 
starting at Line 7 for updating the local encoding kernels for the edges in f \ 0(s) in an upstream-to-downstream 
order The choosing of kg is realized by the pseudo codes between Line 8 and Line 25. 

We analyze the time complexity of the algorithm for the representative special case that rt — r and dt = d for 
all t ^ T, where r < miufgT- maxflow(s, t) and d < r — uj + 1. For Line 3, the augmenting paths for all the sinks 
can be found in time OdTHf |r) dU. Line 16 and 18 can be realized by solving a system of linear equations which 
take time 0{r^) and 0(1), respectively, and each of these two lines is repeated 0{d\£\\T\{^^j)) times. Line 26 
can be solved by the method in Lemma [TTI in time C(^|T| (^^J^'j^^) ((5^ + |7~|('^^Jf]^^))), where S is the maximum 
incoming degree of Q, and this line is repeated 0{\£\) times. Under the assumption that each edge is on some 
chosen path from the source to the sinks, S < r\T\- Summing up all the parts, we obtain the complexity 

0{6\£\\T\e{S' + \T\a + r'd\£\\r\0, (50) 

where e'^r/l-^). 

Subsequent to a conference paper of this work 1241 . Matsumoto 1261 proposed an algorithm to construct network 
codes that achieve the refined Singleton bound. In Table H] we compare the performances of Algorithm 1, Algorithm 
2 and Matsumoto's algorithm. When Ug, lu, S, d and r are fixed (i.e., we regard \T\ and £ as variables) and d > 1, 
the complexities of these algorithms are 0{\r\^\£\^'^-^), 0{\r\^\£\'^'^-^) and 0(|rp|f P'^"^), respectively. 

D. An Example of Algorithm 2 

We give an example of applying Algorithm 2 to the network {Q,s,{t,u}) shown in Fig. [3] In this network 
the maximum flow to each sink node is three. We show how Algorithm 2 outputs a network code with w = 1, 
n = ru = 3 and dmin,t = d^nm^u = 3. Here the finite field F = G¥{2'^) = {0, 1, a, a^), where + a + 1 = 0. 
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TABLE I 

Comparison of deterministic construction algorithms of network error-correcting codes . ^ = ( j) 





field size 


Time complexity 




Algorithm [T] 






\r\0 + \£\\T\n{n+\T\))) 


Algorithm |2] 


irir 


Oi5\£\\T\e{S^ + 




(26] Fig. 2] 


iri€ 




f r + d)) 



The order on the set of edges is labelled in Fig. |3] and we also refer to an edge by its order From s to 
each sink node, there are three edge-disjoint paths. We fix a particular path from s to t given by the sequence 
of edges 3, 6, 8 and a path from s to m given by the sequence of edges 3, 7, 9. The other edge-disjoint paths 
can be uniquely determined. We can check that each edge is on at least one path. As we have described, define 

= {{s, a, d, e}, {1, 2, 3}), = ({s, a, d, e}, {1, 2, 3, 4}) and so on. 

We choose the codebook C — ((1, a, a^)), which is a Reed-Solomon code. Let x = (1, a, a^). Note that we only 
need to check x with the feasible condition. The reason is that the constraint to choose kg in ( |49] l is unchanged 
by multiplying a nonzero elements in F (see also Section IV-Eb . 

Notice that nodes b, c, d and e have only one incoming edges. We assume WLOG that the nodes b, c, d and 
e only copy and forward their received symbols. We refer the reader to [29 , Section 17.2] for an explanation that 
this assumption does not change the optimality of our coding design. 

In the following, we show that Algorithm 2 can give /Ja.g — /S^^q = [S^j = fi^j = 1 and /?5,6 = f34j = 0. 
Together with the local encoding kernels associated with nodes 6, c, d and e, we have a set of local encoding 
kernels satisfying the minimum distance constraints. 

We skip the first two iterations, in which we assign /3i 4 = 1 and /?2.5 = 1- In the third iteration, edge 6 is added 
to the graph and we need to determine 

n T 



/33,6 134,6 Pi 



5,6 

We have 

10 10 
10 1 
= 1 
1 
1 

We first consider node t. We see that edge 6 is on the third path to t. In this iteration, \{t) — {1, 5, 6}. We consider 
the following four cases of C such that 3 ^ £: 

1) £ = 0: Since zi = (1,0, 0,0, a) satisfies (Fj^(x, — zi))\'^ — 0, we need to choose kg such that (xAq^^^^ — 



January 22, 2010 



DRAFT 



25 




Zi)F^k6 ^ 0. This gives 



/33,6 + 0. 



(51) 



We also have Z2 = (1, a, 0, 0, 0) satisfies {F^iy^, — Z2))\'^ = 0. This error vector imposes the same constraint 
that ;33,6 ^ 0. 

2) £ = {1}: Since Z3 = (0,0,0,0,0;) satisfies (F(^(x, -Zs))^^^'^^ = 0, we need to choose kg such that 



(xA2 - Z3)F2k6 + 0. This gives 

/33,6a' + /34,6 ^ 0. 

3) £ = {2}: Similar to the above case, we have 

^3,6^2 + /Sg^ga ^ 0. 

4) £ = {1, 2}: We need xA^ ^^^F^kg 7^ 0, i.e., 

/?3,6a^ + /34,g + /?5,6a ^ 0. 
Similarly, we can analyze sink node u and obtain the following constraints on kg: 

/34,6 + 

/33,6a' + /34,6 ^ 

^4,6 + /35,6a 7^ 
^ZfiO? + ;34,6 + /35,6a + 0. 

Form ( BTI ) to ( ISST l, we have six distinct constraints, which are satisfied by /?3 g = /?4_g = 1 and /^s.g = 0. 
Then we go to the fourth iteration, for which edge 7 is added to the graph and we need to determine 



(52) 

(53) 

(54) 

(55) 
(56) 
(57) 
(58) 



k7 = 



/33,7 /34,7 /35,7 
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We have 

10 10 1 

10 10 

1 1 

1 1 

1 

1 

Edge 7 is on the second path to t. Considering all C such that 2 ^ £, we obtain the following constraints on 



(59) 
(60) 
(61) 
(62) 
(63) 
(64) 

(65) 
(66) 
(67) 
(68) 

From i59[ to ( |68T l, we have seven distinct constraints on k7, which are satisfied by (S^j — (3^j — 1 and /S^j = 0. 

Let us see what would happen if we only consider £ = 0. For this case, in iteration 3, we have only two 
constraints given by (ISTl i and dSSl l. which are satisfied by /Js.g ~ Pis = 1 and (3^^ ~ a. We see that these values 
do not satisfiy ( |53] l. We now show that it is impossible to find a network code with dmin f = 3 with these values. 
Construct an error vector z as follows: zi — \,z-j — — (/Ss^ya^ + /Ssja) and = for i ^ 1, 7. We check that 

f^t(x,z) =xF,,t+zFt 



/35,7 ^ 
fizjo? + ;35,7a 7^ 

/53,7 + /34.7 + k.lOi ^ 

P-i^io? + fi^jo? + P^.'ja ^ 
fhja^ + ;35.7a 7^ 
Psja^ + /34,7 + k^ja ^ 0. 
Similarly, we can analyze sink node u and obtain the following constraints on ky: 

/33,7 + /34,7 / 

/33,7a2 + /34,7 / 
/33,7a^ + /34,7a^ + Pzja ^ 
^3,7a^ + Paj + ^5 ja 7^ 



(l,a,a^) 


1 /34,7 1 


+ (1,^7) 


1 /34,7 1 




/35,7 a /33,7 1 




1 



Thus, dnvin,t < wh{z) = 2. 
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E. Proof of Theorem \12\ 

Theorem [T2l is proved by induction on i. The codebook with d^jn ^ > for all i G T satisfies the feasible 
condition for i = 0. Assume that up to the (fc — l)th iteration, where < fc — 1 < |£| — n^, we can find local 
encoding kernels such that the feasible condition is satisfied for all i < k. In the /sth iteration, let e be the edge 
appended to Q^^^ to form Q'^ . We will show that there exists kg such that the feasible condition continues to hold 
for i — k. 

We first consider a sink node t for which edge e is not on any path from the source node s to t. (Such a sink 
node does not necessarily exist). For all C, x and z'' satisfying C|2l)-C|4| with k in place of i, we have 

(i^,'=(x, -z'=))\^ = iFt-\^, -^iz'')\'))\^ (69) 

7^ 0, (70) 

where (|69]) follows from (l44l i. and dTOl) follows from the induction hypothesis, i.e., the feasible condition is satisfied 
for i = k — 1 by noting w/f ((z'^)\'^) < w/f(z'^) < dt — 1 — |£|. Therefore, (|48]) holds for i = /c regardless of the 
choice of kg . 

For a sink node t such that edge e is on the jth edge-disjoint path from the source node s to t, we consider two 
scenarios for C, namely j E C and j ^ C. For all C satisfying C|2| and j E C, and all x and z'' satisfying Cj3]l and 
CHli for i = k, 

{Fl^i^x, -z'=))\^ = {Fi'~\x, -(z'=)\'=))\^ (71) 

7^ 0, (72) 

where dTTT l follows from ( |47] | and ( f72] i follows from the induction hypothesis using the same argument as the 
previous case. Therefore, (gUl again holds for i = k regardless of the choice of kg. 

For all C satisfying C© and j ^ C, all x satisfying Cj3]l and all z*^ satisfying C|4|i with i = k, ( l48T l holds for 
i = fc if and only if either 

(F,'=(x,-z'=))\^uW ^0 (73) 

or 

(F,'^-(x,-z'=)), ^0. (74) 
By (|47ll and (|46]l, (l73]l and (EUl are equivalent to 

(i^^i(x,-(z'^-)\'=))\^uW ^0, (75) 

and 

(xA^-i - (z*)\*))F^-ike - (z^-)fc ^ 0, (76) 
respectively. Note that kg is involved in ( |76] | but not in ( |75] |. 
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For an index set £ satisfying Cl2| and j ^ C, let be the set of all (x, z'^) that do not satisfy (|75] |. where x 
satisfies CO and z'' satisfies CSI for i = k. We need to find a proper kg such that for any (x, z'') e S^, (x, z''') 
satisfies ( |76] |. In the following technical lemmas, we first prove some properties of S^. 

Lemma 13: If the feasible condition holds for i — k — 1, then for any (x, z*^) e S^, wh{z^) = dt — 1 — \C\ and 

f roq/;- Fix (x, z*^) £ S^. If |£| = dj — 1, since wh{z^) < df-l-|£| = 0, the lemma is ti-ue. If < \C\ < dt — 1, 
we now prove that whUz'^)^'^) > dt — 2 — \C\. If whU'z'')'^'') < dt — 2 — \C\, by the assumption that the feasible 
condition holds for i ~ k — 1, 

iFt^-\^, -(z'=)\'=))^ufa} _^ Q (77) 
i.e., (x, z'^) satisfies ( fTsT l. a contradiction to (x, z*^) e E^. Therefore 

dt-l-\C\< ii;H((z'=)\") (78) 
< u'hIz''^) (79) 
<dt~l-\C\. (80) 

Hence, u'_f/((z'^)^'^) = wh{z'') = dt — I - \C\. This also implies that (z'^)fc =0. ■ 
Lemma 14: Let M be a matrix, and let j be a column index of M. If a system of linear equations a;M = 
with X as the variable has only the zero solution, then xMS^ = has at most a one-dimensional solution space. 

Proof: The number of columns of M is at least the number of rows of M, otherwise the system of linear 
equations .tM = cannot have a unique solution. Let m be the number of rows in M. We have rank(M) = m. 
Let Nu11(M\^) be the null space of defined as 

Nu11(M\^) = {x : xM\^' = 0}. 

By the rank-nullity theorem of linear algebra, we have 

rank(M\^') + dim(Null(M\j')) = m. 

Hence, 

dim(Null(M\^)) = rank(M) - rank(M\^) 
< 1. 

The proof is completed by noting that Nu11(M\^ ) is the solution space of xM\i = with X as the variable. ■ 
Lemma 15: Let p be an error pattern with |p| = dt — 1 — \C\, where < |£| < dt — 1. If the feasible condition 
holds for i = k — 1, the span of all (x, z*^) G E^ with z'^ e p* is either empty or a one-dimensional linear space. 
Proof: Consider the equation 

(F^i(x,-(z'=)\'=))\^ = (81) 
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with X G C and z*^ G p* as variables. Since C and p* are both vector spaces, dSTT l is a system of linear equations. 
By the assumption that the feasible condition holds for i = fc — 1, (ISTT i has only the zero solution. By Lemma [T4l 
the system of linear equations 

(F,'=-i(x,-(z'^)\^-))\^u<-''> =0, 

with X G C and z*^ G p* as variables, has at most a one-dimensional solution space. ■ 
Lemma 16: If the feasible condition holds for i = k — 1, there exist at most ^^'"'^'^•'■''"^ values of kg 

such that ( |76] l does not hold for some (x, z'^) G S^. 

Proof: For (xo,z§) G S^, by Lemma [T3] (zo)fc = 0. Thus, all the kg satisfying 

(xoAS(,) - (z§)\'^)F'=ke = (82) 
do not satisfy ( |76] | for (xq, Zq) G S£. To count the number of solutions of ( |82] |. we notice that 

(F^i(xo,-(zg)\^-))\^#0, (83) 
by the feasible condition holding for i = k — 1, and 

(F^i(xo,-(zg)\^-))\^ut^>=0, (84) 

since (xo,Zq) G S£. Thus, 

(F^i(xo,-(zg)\^)), = ((xoA^-^ - (zg)\'=)F'=-i(A,y)^), ^ 0, 

which gives a nonzero component of (xqAq^^^^ — (z§)\''")F'^'^^ corresponds to the edge that precedes edge e on 
the jth path from s to t. This shows that the components of (xqAq^^^^ — (zq)\''')F'^^^ corresponding to the edges 
in I(tail(e)) are not all zero. On the other hand, a component of kg can possibly be nonzero if and only if it 
corresponds to an edge in I(tail(e)). Therefore, the solution space of ke in ( [82] i is an pj^^^'^''^'^))! ^-dimensional 
subspace. 

By Lemma [T3l for each (x, z'') G S^, z*"' must match an error pattern p with \p\ ~ dt — 1 — \C\ and e ^ p. Since 
there are totally + fc — 1 edges in Q'' excluding e, there are (J^'l^^lj^j) error patterns with size dt — 1 — \C\. 

Consider an error pattern p with \p\ = dt — 1 — \C\ and e ^ p. By Lemma [TS] if (xq, Zq) G S£ with Zq G p*, 
all (x, z''") G with z*"' G p* can be expressed as (q!Xo,q:Zq) with nonzero a G F. Since we obtain the same 
solutions of ke in (l82T l when xq and Zq are replaced by axo and azg, respectively, for a particular pattern p, we 
only need to consider one (xo,Zq) G S£ with Zq G p*. 

Upon considering all error patterns p with \p\ ~ dt — 1 — \C\ and e ^ p, we conclude that there exist at most 
(d"liH£|)9''^'"'^''''^'~^ v^luss of ke not satisfying dZS]) for some (x,z'') G S^. ■ 

Considering the worst case that for all i G 7", edge e is on an edge-disjoint path from the source node s to sink 
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node t, and considering all the index set C with < |£| < dt — I and j ^ C for each sink node t, we have at most 



^ ^rt - l\ f Us + k - 1 



I J\dt-l-l 



^|I(mil(e))|-l 



EE 

teT 1=0 

vectors that cannot be chosen as kg. Note that ( |86] | is justified because < k — 1 < \£\ — Ug. Since g > 
J2teT ('^'df-'i"^)' there exists a choice of kg such that for all C satisfying and j ^ C, all x satisfying CQ, 
and all z*^ satisfying C|4|l for i = k, (|48] | holds for i — k. Together with the other cases (where the choice of kg is 
immaterial), we have proved the existence of a kg such that the feasible condition holds for i — k. 

VI. Concluding Remarks 

This work, together with the previous work ifTSl . gives a framework for coherent network error correction. The 
work 1 15 1 characterizes the error correction/detection capability of a general transmission system with network 
coding being a special case. The problems concerned here are the coding bounds and the code construction for 
network error correction. 

In this work, refined versions of the Hamming bound, the Singleton bound and the Gilbert- Varshamov bound for 
network error correction have been presented with simple proofs based on the distance measures developed in [l5\. 
These bounds are improvements over the ones in ||6l, ifTOl . ifTTl for the linear network coding case. Even though 
these bounds are stated based on the Hamming weight as the weight measure on the error vectors, they can also 
be applied to the weight measures in lfT2l . lfT6l . ifTTl because of the equivalence relation among all these weight 
measures (See IBl, 1281 ). 

Like the original version of the Singleton bound |l6l, ifTOl . the refined Singleton bound for linear network codes 
proved in this paper continues to be tight. Two different construction algorithms have been presented and both of 
them can achieve the refined Singleton bound. The first algorithm finds a codebook based on a given set of local 
encoding kernels, which simply constructs an MDS code when the problem setting is the classical case. The second 
algorithm constructs a set of of local encoding kernels based on a given classical error-correcting code satisfying a 
certain minimum distance requirement by recursively choosing the local encoding kernels that preserve the required 
minimum distance properties. 

There are many problems to be solved towards application of network error correction. Our algorithms require a 
large field size to guarantee the existence of network codes with large minimum distances. One future work is to 
consider how to relax this field size requirement. Fast decoding algorithms of network error-correcting codes are 
also desired. Moreover, network error correction in cyclic networks is sill lack of investigation. 
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